Export LLM documentation revamp #12381

jackzhxng · 2025-07-10T23:44:34Z

Summary

Structure:

New Getting Started page
AOT (export)
- Old getting started page, which was the NanoGPT tutorial, is moved to export-custom-llm.md, with the runner sections removed to add to the run-with-c-plus-plus.md
- New export-llm.md page for exporting LLMs with export_llm API
Runtime
- iOS/Android app docs remain, they detail steps to take after the .pte is generated for running on-device
- Added a C++ runner page for @larryliu0820 to fill out with the new runner APIs
- Since the QNN Llama tutorial is highly custom, we are going to leave the export section in it as well instead of dividing like we did for the rest of the tutorials

pytorch-bot · 2025-07-10T23:44:38Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/12381

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 3 New Failures

As of commit 1a3e422 with merge base dd4488d ():

NEW FAILURES - The following jobs have failed:

Build Presets / linux (llm, linux.arm64.2xlarge, executorch-ubuntu-22.04-gcc11-aarch64) / build (gh)
pull / android / run-emulator (gh)
The process '/usr/bin/sh' failed with exit code 255
pull / test-eval_llama-mmlu-linux / linux-job (gh)
RuntimeError: Command docker exec -t efcb5ae99e0012abf33ac56e93396442c77149c82b507bd1c031b7aaf28800d7 /exec failed with exit code 1

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2025-07-10T23:45:13Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

docs/source/llm/getting-started.md

jackzhxng

Thanks for adding the runner doc @larryliu0820

docs/source/llm/run-with-c-plus-plus.md

docs/source/llm/export-llm.md

docs/source/llm/export-custom-llm.md

docs/source/llm/export-llm.md

shoumikhin · 2025-07-16T23:21:54Z

Please make a placeholder for ObjC/Swift runtime APIs and i'll fill it out

jackzhxng · 2025-07-16T23:42:30Z

@shoumikhin I have left the existing llama-demo-ios.md untouched and linked to it from getting-started.md, feel free to make any changes to llama-demo-ios.md that you need

mergennachin

export-llm is a great page, thank you for working on this

docs/source/index.md

docs/source/llm/getting-started.md

docs/source/llm/export-llm.md

shoumikhin · 2025-07-18T23:16:20Z

Instead of llama-demo-ios.md we should probably just point to examples/demo-apps/apple_ios/LLaMA/README.md?
I meant some place in the docs where we describe the LLM runtime API in C++ and how to use it. I can append some info on ObjC/Swift API there.

jackzhxng · 2025-07-19T00:22:33Z

@shoumikhin sure, feel free to open a PR against this!

shoumikhin · 2025-07-19T01:16:04Z

docs/source/index.md

+- [Running with C++](llm/run-with-c-plus-plus.md)
+- [Running on Android (XNNPack)](llm/llama-demo-android.md)
+- [Running on Android (QNN)](llm/build-run-llama3-qualcomm-ai-engine-direct-backend.md)
+- [Running on iOS](llm/llama-demo-ios.md)


Can you create a placeholder page like that similar to Running with C++ one?
I'll make it refer llama-demo-ios.md which seems to redirect to the readme already.

Suggested change

- [Running on iOS](llm/llama-demo-ios.md)

- [Running on iOS](llm/run-on-ios.md)

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 10, 2025

jackzhxng force-pushed the jz/export-llm-docs branch 20 times, most recently from 89e52e9 to 0ffe50a Compare July 11, 2025 22:13

jackzhxng requested a review from larryliu0820 July 11, 2025 22:15

jackzhxng force-pushed the jz/export-llm-docs branch from 0ffe50a to dc267a8 Compare July 14, 2025 18:36

jackzhxng changed the base branch from main to export-llm-docs July 14, 2025 18:36

jackzhxng marked this pull request as ready for review July 14, 2025 18:36

jackzhxng requested review from JacobSzwejbka, mergennachin and Gasoonjia as code owners July 14, 2025 18:36

jackzhxng removed request for swolchok, digantdesai, shoumikhin, SS-JIA, cccclai, kimishpatel, Gasoonjia, mcr229, manuelcandales and kirklandsign July 14, 2025 18:47

mergennachin reviewed Jul 14, 2025

View reviewed changes

docs/source/llm/getting-started.md Outdated Show resolved Hide resolved

jackzhxng force-pushed the jz/export-llm-docs branch 4 times, most recently from 8753f2b to 6af121b Compare July 14, 2025 23:42

jackzhxng commented Jul 16, 2025

View reviewed changes

metascroy reviewed Jul 16, 2025

View reviewed changes

docs/source/llm/export-custom-llm.md Show resolved Hide resolved

metascroy reviewed Jul 16, 2025

View reviewed changes

docs/source/llm/export-custom-llm.md Show resolved Hide resolved

metascroy reviewed Jul 16, 2025

View reviewed changes

docs/source/llm/export-llm.md Outdated Show resolved Hide resolved

Update LLM documentation

1b2fd42

jackzhxng force-pushed the jz/export-llm-docs branch from 6fda1ef to 1b2fd42 Compare July 16, 2025 23:23

Scott comments

659db4f

mergennachin approved these changes Jul 17, 2025

View reviewed changes

Mergen comments

1a3e422

jackzhxng force-pushed the jz/export-llm-docs branch from 46cfe37 to 1a3e422 Compare July 19, 2025 00:21

shoumikhin reviewed Jul 19, 2025

View reviewed changes

	- [Running on iOS](llm/llama-demo-ios.md)
	- [Running on iOS](llm/run-on-ios.md)

Export LLM documentation revamp #12381

Are you sure you want to change the base?

Export LLM documentation revamp #12381

Uh oh!

Conversation

jackzhxng commented Jul 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

pytorch-bot bot commented Jul 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/12381

❌ 3 New Failures

Uh oh!

github-actions bot commented Jul 10, 2025

This PR needs a release notes: label

Uh oh!

Uh oh!

jackzhxng left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

shoumikhin commented Jul 16, 2025

Uh oh!

jackzhxng commented Jul 16, 2025

Uh oh!

mergennachin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

shoumikhin commented Jul 18, 2025

Uh oh!

jackzhxng commented Jul 19, 2025

Uh oh!

shoumikhin Jul 19, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jackzhxng commented Jul 10, 2025 •

edited

Loading

pytorch-bot bot commented Jul 10, 2025 •

edited

Loading

This PR needs a `release notes:` label